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Abstract - In this paper, one studies the famous well-known and challenging Tweety Penguin Triangle Problem (TPTP or TP2) 
pointed out by Judea Pearl in one of his books. We first present the solution of the TP2 based on the fallacious Bayesian reasoning and 
prove that reasoning cannot be used to conclude on the ability of the penguin-bird Tweety to fly or not toffy. Then we present in details 
the counter-intuitive solution obtained from the Dempster-Shafer Theory (DST). Finally, we show how the solution can be obtained 
with our new theory of plausible and paradoxical reasoning (DSmT). 
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1 Introduction 

Judea Pearl claimed that DST of evidence fails to provide a reasonable solution for the combination of evidence even 
for apparently very simple fusion problem 1121 1131 . Most criticisms are answered by Philippe Smets in 1231 1241 . The 
Tweety Penguin Triangle Problem (TP2) is one of the typical exciting and challenging problem for all theories managing 
uncertainty and conflict because it shows the real difficulty to maintain truth for automatic reasoning systems when the 
classical property of transitivity (which is basic to the material-implication) does not hold. In his book 1 12 1, Judea Pearl 
presents and discusses in details the semantic clash between Bayes vs. Dempster-Shafer reasoning. We present here our 
new analysis on this problem and provide a solution of the Tweety Penguin Triangle Problem based on our new theory of 
plausible and paradoxical reasoning, known as DSmT (Dezert-Smarandache Theory). We show how this problem can be 
attacked and solved by our new reasoning with help of the (hybrid) DSm rule of combination 1211 . 

The purpose of this paper is not to browse all approaches available in literature for attacking the TP2 problem but 
only to provide a comparison of the DSm reasoning with respect to the Bayesian reasoning and to the plausible reasoning 
of DST framework. Interesting but complex analysis on this problem based on default reasoning and e-belief functions 
can be also found by example in 1231 and QJ. Other interesting and promising issues for the TP2 problem based on the 
fuzzy logic of Zadeh 1 26 1 jointly with the theory of possibilities 1 6 7 1 are under investigations. Some theoretical research 
works on new conditional event algebras (CEA) have emerged in literature 1 8 1 since last years and could offer a new track 
for attacking the TP2 problem although unfortunately no clear didactic, simple and convincing examples are provided to 
show the real efficiency and usefulness of these theoretical investigations. 



2 The Tweety Penguin Triangle Problem 

This very important and challenging problem, as known as the Tweety Penguin Triangle Problem (TP2) in literature, is 
presented in details by Judea Pearl in [12|. We briefly present here the TP2 and the solutions based first on fallacious 
Bayesian reasoning and then on the Dempster-Shafer reasoning. We will then focus our analysis of this problem from the 
DSmT framework and the DSm reasoning. 

Let's consider the set R = {r\, r$} of given rules: 

• fx: Penguins normally don't fly'' <^> (p — > -if) 

• r 2 : "Birds normally fly" (b — > /) 

• r 3 : "Penguins are birds" <^> (p — > b) 
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To emphasize our strong conviction in these rules we commit them some high confidence weights wi, W2 and W3 in [0, 1] 
with W\ = 1 — ei, W2 = 1 — £2 an d 11)3 = 1 (where t\ and £2 are small positive quantities). The conviction in these rules 
is then represented by the set W — {wi, W2, W3} in the sequel. 

Another useful and general notation adopted by Judea Pearl in the first pages of his book 1 12 1 to characterize these 
three weighted rules is the following one (where wt, W2,W3 € [0, 1]): 

ri:p->[-<f) r 2 :b^j r 3 : p -> b 

When wi,W2,W3 6 {0, 1} the classical logic is the perfect tool to conclude on the truth or on the falsity of a propo- 
sition built from these rules based on the standard propositional calculus mainly with its three fundamental rules (Modus 
Ponens, Modus Tollens and Modus Barbara - i.e. transitivity rule). When < W\,W2, W3 < 1, the classical logic can't be 
applied because the Modus Ponens, the Modus Tollens and the Modus Barbara do not longer hold and some other tools 
must be chosen. This will discussed in detail in section I3T2I 

Question: Assume we observe an animal called Tweety (T) that is categorically classified as a bird (b) and a penguin (p), 
i.e. our observation is O = [T = (b n p)] = [(T = b) H (T = p)]. The notation T = {bC\p) stands here for "Entity T 
holds property (b fl p)". What is the belief (or the probability - if such probability exists) that Tweety can fly given the 
observation O and all information available in our knowledge base (i.e. our rule-based system R and W) ? 

The difficulty of this problem for most of artificial reasoning systems (ARS) comes from the fact that, in this example, 
the property of transitivity, usually supposed satisfied from material-implication interpretation HI 21 . (p — * b, b — > /) => 
(p —> /) does not hold here (see section l3T2i . In this interesting example, the classical property of inheritance is thus 
broken. Nevertheless a powerful artificial reasoning system must be able to deal with such kind of difficult problem and 
must provide a reliable conclusion by a general mechanism of reasoning whatever the values of convictions are (not only 
restricted to values close to either or 1). We examine now three ARS based on the Bayesian reasoning 1 12 1 which turns 
to be fallacious and actually not appropriate for this problem and we explain why, on the Dempster-Shafer Theory (DST) 
11171 and on the Dezert-Smarandache Theory (DSmT) 1211 . 

3 The fallacious Bayesian reasoning 

We first present the fallacious Bayesian reasoning solution drawn from the J. Pearl's book in (,12 1 (pages 447-449) and 
then we explain why the solution which seems at the first glance correct with intuition is really fallacious. We then explain 
why the common rational intuition turns actually to be wrong. 



3.1 The Pearl's analysis 

To preserve mathematical rigor, we introduce explicitly all information available in the derivations. In other words, one 
wants to evaluate using the Bayesian reasoning, the conditional probability, if it exists, P(T — f\0, R, W) = P(T = 
f\T = p,T = b,R,W). The Pearl's analysis is based on the assumption that a conviction on a given rule can be 
interpreted as a conditional probability (see [12j page 4). In other words if one has a given rule a — > b with w E [0, 1] 
then one can interpret, at least for the calculus, w as P(b\a) and thus the probability theory and Bayesian reasoning can 
help to answer to the question. We prove in the following section that such model cannot be reasonably adopted. For now, 
we just assume that such probabilistic model holds effectively as Judea Pearl does. Based on this assumption, since the 
conditional term/information (T = p,T = b, R, W) is strictly equivalent to (T = p, R, W) because of the knowledge of 
rule r3 with certainty (since W3 = 1), one gets easily the fallacious intuitive expected Pearl's result: 

P{T = f\0, R, W) = P(T = f\T = p,T = 6, R, W) 
PIT = f\0, R, W) = P(T = f\T= p, R, W) 
P(T = f\0, R,W) = 1- P(T = -n/|T = p, R, W) 
P(T = f\0,R,W) =l-w 1 = e 1 

From this simple analysis, the Tweety 's "birdness" does not render her a better flyer than an ordinary penguin as intuitively 
expected and the probability that Tweety can fly remains very low which looks normal. We reemphasize here the fact, that 
in his Bayesian reasoning J. Pearl assumes that the weight w\ for the conviction in rule r\ can be interpreted in term of a 
real probability measure P(-if\p). This assumption is necessary to provide the rigorous derivation of P(T = f\0, R, W). 
It turns out however that convictions Wi on logical rules cannot be interpreted in terms of probabilities as we will prove in 
the next section. 
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When rule r 3 is not asserted with absolute certainty (i.e. 11)3 = 1) but is subject to exceptions, i.e. W3 = 1 — 63 < 1, 
the fallacious Bayesian reasoning yields (where notations T = f,T = b and T = p are replaced by /, b and p for notation 
convenience): 

P(f\0,R,W) = P(f\p,b,R,W) 
P{f,p,b\R, W) 



P(f\0,R,W) = 
P(f\0,R,W) = 



P{p,b\R, W) 
P(f,b\P,R, W)P( P \R,W) 



P{b\p, R, W)P{p\R, W) 
By assuming P(p\R, W) > 0, one gets after simplification by P(p\R, W) 

P(f,b\p,R,W) 



P(f\0,R,W) 
P(f\0,R,W) = 



P(b\p, R, W) 

P(b\f,p,R,W)P(f\p,R,W) 



P(b\p, i?, W) 

If one assumes P(b\p, R, W) = w 3 = 1 — e 3 and P{f\p, i?, W) = 1 - P(~<f\p, R, W) = 1 — Wi = t\, one gets 

P(J\0, R, W) = P(b\f,p, R, W) x -5— 

J- _ e 3 

Because < P(b\f,p, R, W) < 1, one finally gets the Pearl's result 021 (P-448) 

P(f\0,R,W)<-^— (1) 

1 - £3 

which states that the observed animal Tweety (a penguin-bird) has a very small probability of flying as long as £3 re- 
mains small, regardless of how many birds cannot fly (£2), and has consequently a high probability of not flying because 
P(f\0, R, W) + P(f\0, R, W) = 1 since the events / and / are mutually exclusive and exhaustive (assuming that the 
Pearl's probabilistic model holds ... ). 

3.2 The weakness of the Pearl's analysis 

We prove now that the previous Bayesian reasoning is really fallacious and the problem is truly undecidable to conclude 
about the ability of Tweety to fly or not to fly if a deep analysis is done. Actually, the Bayes' inference is not a classical 
inference [3 |. Indeed, before applying blindly the Bayesian reasoning as in the previous section, one first has to check 
that the probabilistic model is well-founded to characterize the convictions of the rules of the rule-based system under 
analysis. We prove here that such probabilistic model doesn't hold for a suitable and useful representation of the problem 
and consequently for any problems based on the weighting of logical rules (with positive weighting factors/convictions 
below than 1). 

3.2.1 Preliminaries 

We just remind here only few important principles of the propositional calculus of the classical Mathematical Logic which 
will be used in our demonstration. A simple notation, which may appear as unusual for logicians, is adopted here just for 
convenience. A detailed presentation of the propositional calculus and Mathematical Logic can be easily found in many 
standard mathematical textbooks like 1 1*51 II lllTol . Here are these important principles: 

• Third middle excluded principle : A logical variable is either true or false, i.e. 

a V ->a (2) 

• Non-contradiction law : A logical variable can't be both true and false, i.e. 

n(aAna) (3) 

• Modus Ponens : This rule of the propositional calculus states that if a logical variable a is true and a — > b is true, 
then b is true (syllogism principle), i.e. 

(a A (a — > b)) — > b (4) 

• Modus Tollens : This rule of the propositional calculus states that if a logical variable -^b is true and a — > b is true, 
then -^a is true, i.e. 

(-.6 A (a -> b)) -> -na (5) 
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• Modus Barbara : This rule of the propositional calculus states that if a — > b is true and b — > c is true then a — > c 
is true (transitivity property), i.e. 

((a -> 6) A (6 -> c)) -► (a ->• c) (6) 

From these principles, one can prove easily, based on the truth table method, the following property (more general 
deducibility theorems in Mathematical Logic can be found in 1 19 20 1) : 

((a — ► 6) A (c — >• d)) — » ((a A c) — ► (6 A d)) (7) 
3.2.2 Analysis of the problem when e\ = £2 = £3 = 

We first examine the TP2 when one has no doubt in the rules of our given rule-based systems, i.e. 

1 (-/) 
~ X b 

From rules r\ and T2 and because of property {7}, one concludes that 

pAb^(fA^f) 

and using the non-contradiction law Q with the Modus Tollens 0, one finally gets 

"-(/A -./)--.(? A 6) 

which proves that p A 6 is always false whatever the rule r% is. Interpreted in terms of the probability theory, the event 
T = pp\b corresponds actually and truly to the impossible event since T = f and T = f are exclusive and exhaustive 
events. Under such conditions, the analysis proves the non-existence of the penguin-bird Tweety. 

If one adopts the notations 1 of the probability theory, trying to derive P(T — f\T — pDb) and P(T = f\T = pDb) 
with the Bayesian reasoning is just impossible because from one of the axioms of the probability theory, one must have 
P(0) = and from the conditioning rule, one would get expressly for this problem the indeterminate expressions: 

P(T=/|T = 0) 
P(T = /n0) 



and similarly 



P(T = 


f\T-- 


= pnb) 


P(T = 


f\T-- 


= pnb) 


P{T = 


f\T-- 


= pnb) 


P(T = 


f\T-- 


= pnb) 


P(T = 


f\T-- 


= pnb) 


P(T = 


f\T-- 


= pnb) 


PIT = 


f\T-- 


= pnb) 


P(T = 


f\T-- 


= pnb) 


ten < 


ei,e 2 


e 3 < 1 



P(T = 2 
P(T = 0) 
P{T = 0) 





(indeterminate) 



P(T = /ni 
P(T = 0) 
P(T = 0) 
P(T = 0) 




Q 



(indeterminate) 



Let's examine now the general case when one allows some little doubt on the rules characterized by taking t\ > , 62 > 
and 63 > and examine the consequences on the probabilistic model on these rules. 



'Because probabilities are related to sets, we use here the common set-complement notation / instead of the logical negation 
notation -1/, n for A and U for V if necessary. 
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First note that, because of the third middle excluded principle and the assumption of the existence of a probabilistic 
model for a weighted rule, then one should be able to consider simultaneously both "probabilistic/Bayesian" rules 



P(b\a)=w 

a — > b 

P(b\a) = l-w , W 

— > -ib 

In terms of classical (objective) probability theory, these weighted rules just indicate that in 100 x w percent of cases the 
logical variable b is true if a is true, or equivalently, that in 100 x w percent of cases the random event b occurs when the 
random event a occurs. When we don't refer to classical probability theory, the weighting factors w and 1 — w indicate 
just the level of conviction committed to the validity of the rules. Although very appealing at the first glance, this prob- 
abilistic model hides actually a strong drawback/weakness specially when dealing with several rules as shown right below. 

Let's prove first that from a "probabilized" rule a P< - b i^ w b one cannot assess rigorously the convictions onto its 
Modus Tollens. In other words, from (El what can we conclude on 

n& ^ (9) 

P(a|6)=? W 
— ► —id 

From the Bayes' rule of conditioning (which must hold if the probabilitic model holds), one can express P(a\b) and 
P(a\b) as follows 

f P(a\b) = 1 - P(a\b) = 1 - = 1 - «M 



l-P(b) l-P(b) 
P(anb) _ 1 _ P(b\a)F 
P(b) - P(b) 

or equivalently by replacing P(b\a) and P(b\a) by their values w and 1 — w, one gets 



P(a\b) = 1 - P(a\b) = 1 - = 1 - P(b|a)P(a) 



' P(a\b) = 1 - (1 - w)-^- 
P(a\b) = l-w$$ 



pW> (10) 



These relationships show that one cannot fully derive in theory P(a\b) and P(a\b) because the prior probabilities P(a) 
and P(b) are unknown. 

A simplistic solution, based on the principle of indifference, is then just to assume without solid justification that 
P(a) = P(a) = 1/2 and P(b) = P(b) = 1/2. With such assumption, then one gets the following estimates P(a\b) = w 
and P(a\b) = 1 — w for P(a\b) and P(a\b) respectively and we can go further in the derivations. 

Now let's go back to our Tweety Penguin Triangle Problem. Based on the probabilistic model (assumed to hold), one 
starts now with both 

P(/|p) = l- ei ( P(/|p)=ei 

n : p -v -./ \P J 

r 2 :b P{flb) ^f \b P m^^f (11) 

P(6|p)=l-6 3 P(b|p)=£3 , 

S3 ■ P b \p — > 

Note that taking into account our preliminary analysis and accepting the principle of indifference, one has also the two 
sets of weighted rules either 

' , P(p|/) = l-£l f , P(p|/)=ei 

V^ 17 ^ 1 - 62 ^ / P ^= e2 ^ (12) 

P(p|6)=l-e 3 P(p|6)=e 3 

-ib — > -ip I o — > -ip 

One wants to assess the convictions (assumed to correspond to some conditional probabilities) into the following rules 

pA6 -» / (13) 



P(/|pnh)=? 

pAo -./ (14) 

The question is to derive rigorously P(f\p n b) and P(/|p PI 6) from all previous available information. It turns out that 
the derivation is impossible without unjustified extra assumption on conditional independence. Indeed, P(f\p D b) and 
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P(f\p n b) are given by 

P(p.b) P(b\p)P(p) 



f P (f\ p n b) - - Ml 



(15) 

{-ryj\pno)— p( p() ) — p(b\p)p{ p ) 

If one assumes as J. Pearl does, that the conditional independence condition also holds, i.e. P(p, = P(p\f)P(b\f) 
and P(p,b\f) = P(p\f)P(b\f), then one gets 

By accepting again the principle of indifference, P(/) = -P(/) = 1/2 and P(p) = P(p) = 1/2, one gets the following 
expressions 

'^(/|pnft) = ^P 

(16) 

Replacing probabilities P(p|/), P(&|/), P(6|p), P(p\f) and P(b\f) by their values in the formula dl6> . one finally gets 

(17) 

,i i (/>n6) = ii T ^ 

Therefore we see that, even if one accepts the principle of indifference together with the conditional independence 
assumption, the approximated "probabilities" remain both small and do not correspond to a real measure of probability 
since the conditional probabilities of exclusive elements / and / do not add up to one. When e\, e 2 and £3 tends towards 
0, one has 

P(f\pnb) + P(f\ P nb)no 

Actually our analysis based on the principle of indifference, the conditional independence assumption and the model pro- 
posed by Judea Pearl, proves clearly the impossibility of the Bayesian reasoning to be applied rigorously on such kind of 
weighted rule-based system, because no probabilistic model exists for describing correctly the problem. This conclusion 
is actually not surprising taking into account the Lewis' theorem 1 14 1 explicated in details in 1 8 1 (chapter 11). 



Let's now explain the reason of the error in the fallacious reasoning which was looking coherent with the common 
intuition. The problem arises directly from the fact that penguin class and bird class are defined in this problem only 
with respect to the "flying" and "not-flying" properties. If one considers only these properties, then none Tweety animal 
can be categorically classified as a penguin-bird, because penguin-birdness doesn't not hold in reality based on these 
exclusive and exhaustive properties (if we consider only the information given within the rules r\, r 2 and r^). Actually 
everybody knows that penguins are effectively classified as bird because "birdness" property is not defined with respect to 
the "flying" or "not-flying" abilities of the animal but by other zoological characteristics C (birds are vertebral oviparous 
animals with hot blood, a beak, feather and anterior members are wings) and such information must be properly taken 
into account in the rule-based systems to avoid to fall in the trap of such fallacious reasoning. The intuition (which seems 
to justify the fallacious reasoning conclusion) for TP2 is actually biased because one already knows that penguins (which 
are truly classified as birds by some other criterions) do not fly in real world and thus we commit a low conviction (which 
is definitely not a probability measure, but rather a belief) to the fact that a penguin-bird can fly. Thus the Pear'ls analysis 
proposed in 1 12 1 appears to the authors to be unfortunately incomplete and somehow fallacious. 



4 The Dempster- Shafer reasoning 

As pointed out by Judea Pearl in 1 12 1, the Dempster-Shafer reasoning yields, for this problem, a very counter-intuitive 
result: birdness seems to endow Tweety with extra flying power ! We present here our analysis of this problem based on 
the Dempster-Shafer reasoning. 



Let's examine in detail the available prior information summarized by the rule r±: "Penguins normally don't fly" <^ 
(p — ► -1/) with the conviction wi = 1 — ei where t\ is a small positive number close to zero. This information, in the 
DST framework, has to be correctly represented in term of a conditional belief Beli(/|p) = 1 — ei rather than directly 



the mass mi (/ fl p) = 1 — e\ . 



Choosing Beli(/|p) = 1 — t\ means that there is a high degree of belief that a penguin-animal is also a nonfiying- 
animal (whatever kind of animal we are observing). This representation reflects perfectly our prior knowledge while the 
erroneous coarse modeling based on the commitment mi (/ Dp) = 1 — t\ is unable to distinguish between rule n and 
another (possibly erroneous) rule like r[ : {—if — > p) having same conviction value w\. This correct model allows us to 
distinguish between n and r[ (even if they have the same numerical level of conviction) by considering the two different 
conditional beliefs Beli (f\p) = 1 — t\ and Bel]/ (p\f) = 1 — ei. The coarse/inadequate basic belief assignment model- 
ing (if adopted) in contrary would make no distinction between those two rules r\ and r[ since one would have to take 
m i if H p) = my (p fl /) and therefore cannot serve as the starting model for the analysis 

Similarly, the prior information relative to rules r 2 : {b — > /) and r 3 : [p — > b) with convictions w 2 = 1 — £2 and 
w 3 = 1 — £ 3 has to be modeled by the conditional beliefs Bel 2 (/|6) = 1 — e 2 and Bel 3 (6|p) = 1 — e 3 respectively. 

The first problem we have to face now is the combination of these three prior information characterized by Beli (f\p) = 
1 — e\, Bel2(/|6) = 1 — £2 and Bei3(&|p) = 1 — 63. All the available prior information can be viewed actually as three in- 
dependent bodies of evidence B\, B 2 and £> 3 providing separately the partial knowledges summarized through the values 
of Beli(/|p), Bel 2 (/|6) and Bel 3 (6|p). To achieve the combination, one needs to define complete basic belief assign- 
ments mi(.), m 2 (.) and m 3 (.) compatible with the partial conditional beliefs Beli(/|p) = 1 — ei, Bel 2 (/|6) = 1 — t 2 
and Bel 3 (6|p) = 1 — £3 without introducing extra knowledge. We don't want to introduce in the derivations some extra- 
information we don't have in reality. We present in details the justification for the choice of assignment mi (.). The choice 
for m 2 {.) and m 3 (.) will follow similarly. 

The body of evidence B\ provides some information only about / and p through the value of Beli (/If?) and without 
reference to b. Therefore the frame of discernment 61 induced by B\ and satisfying the Shafer's model (i.e. a finite set of 
exhaustive and exclusive elements) corresponds to 



6i - {e l ^ fnp, e 2 4 / np,e 3 ± fn P , e 4 = fn P } 



schematically represented by 



f = 8 2 Ud 4 < 



04 = fr\ P 




3 = fn P 


62 = fnp 




0i = fnp 



I = e l u e 3 



p=e 1 u9 2 



The complete basic assignment mi (.) we are searching for and defined over the power set 2 01 which must be compatible 
with Beli(/[p) is actually the result of the Dempster's combination of an unknown (for now) basic belief assignment 
m\ (.) with the particular assignment m'/(.) defined by m'{{p = 63 U 64) = 1; in other worlds, one has 



mi(.) 



,m'i'](.) 



From now on, we introduce explicitly the conditioning term in our notation to avoid confusion and thus we use m\{.\p) = 
mi (.|#3 U 64) instead mi(.). From m'{ {p = 63 U 6*4) = 1 and from any generic unknow basic assignment m[{.) defined 
by its components m' 1 (0) = 0, mi(0i), m[{9 2 ), m[{9 3 ), m[{9 4 ), m[{6i U 2 ), m[(di U 63), m[(di U 4 ), m\{B 2 U 63), 
mi (6 2 U64), m[ (6> 3 U 64), m[ (6»i U 9 2 U 63), m[ (6>i U B 2 U 64), m[ {6 X U 63 U 9 4), m[ (9 2 U9 3 U9 4 ), m[ (6>i U 9 2 U 9 3 U 4 ) 
and applying Dempter's rule, one gets easily the following expressions for mi(.|03 U 64). All mi(.|0 3 U 64) masses are 
zero except theoretically 



mi(0 3 |0 3 U 4 ) = mi'(0 3 U 4 )K(0 3 ) + mi(0i U 3 ) 
+ mi(0 2 U0 3 ) 
+ mi(0iU0 2 U0 3 )]/ifi 



mi(0 4 |0 3 U 4 ) = m" (0 3 U 4 )[mi(0 4 ) + mi(0i U 4 ) 
+ mi(0 2 U0 4 ) 
+ mi(0iU0 2 U0 4 )]/if 1 



mi(0 3 U 4 |0 3 U 4 ) = m'l (0 3 U 4 )[mi(0 3 U 4 ) 

+ m[{6 1 u0 3 u0 4 ) 
+ m' 1 {9 2 ue 3 ue 4 ) 

+ m' 1 (9 1 U0 2 U0 3 U0 4 )}/K 1 

with 

1 

Ki = 1 - m" (0 3 U 4 )[mi(0i) + *ni(ftj) + mi(0j U 2 )] 

To complete the derivation of mi(.|03 U 4 ), one needs to use the fact that one knows that Beli(/|p) = 1 — ei which, 
by definition 1171 . is expressed by 

Beli(/» = Bel 1 (0iU0 3 |0 3 U0 4 ) 

Bel!(/» = toi (0\ J $3 U 4 ) + mi (0 3 \0 3 U 4 ) 

+ m 1 (0 1 U0 3 \0 3 U0 4 ) 
Bdi(/lp) = l-d 

But from the generic expression of mi(.|0 3 U 6*4), one knows also that 7711(6*1 |03 U 4 ) = and mi(0i U 6*3! #3 U 6 I 4) = 0. 
Thus the knowledge of Beli (f\p) = 1 — ei implies to have 

mi(0 3 |03 U 9 4 ) = [mi (d 3 ) + mi (ft U 3 ) 
+ mi(0 2 U0 3 ) 
+ mi(0i U0 2 U0 3 )]/^i 

TOi(0 3 |03U0 4 ) = 1-61 

This is however not sufficient to fully define the values of all components of mi(.|03 U 4 ) or equivalently of all 
components of m[(.). To complete the derivation without extra unjustified specific information, one needs to apply the 
minimal commitment principle (MCP) which states that one should never give more support to the truth of a proposition 
than justified |9|. According to this principle, we commit a non null value only to the less specific proposition involved 
into mi (03 1 3 U 4 ) expression. In other words, the MCP allows us to choose legitimately 

mi(0i) =mi(0 2 ) =TOi(0 3 ) = 
mi (0i U 2 ) = mi (0i U 9 a ) = mi(0 2 U 3 ) = 
to' 1 (0 1 U0 2 U03)/O 

Thus K\ = 1 and mi(03|03 U 4 ) reduces to 

mi(0 3 |03 U 4 ) = mi (0i U 2 U 3 ) = 1 - ei 

Since the sum of basic belief assignments must be one, one must also have for the remaining (uncommitted for now) 
masses of mi(.) the constraint 

mi(0 4 ) + mi (0i U 4 ) + mi (0 2 U 4 ) + mi(0i U 2 U 4 ) 
+mi(0 3 U 4 ) + mi (0i U 3 U 4 ) + mi(0 2 U 3 U 4 ) 

+?77i(0lU0 2 U0 3 U0 4 ) =£1 

By applying a second time the MCP, one chooses mi(0i U 2 U 03 U 4 ) = e 4 . 

Finally, the complete and less specific belief assignment mi(.\p) compatible with the available prior information 
Beli(/|p) = 1 — ei provided by the source B\ reduces to 

toi(0 3 |03 U 4 ) = mi (01 U 2 U 3 ) - 1 - 61 (18) 

mi (0 3 U0 4 |0 3 U0 4 ) =mi(0i U0 2 U0 3 U0 4 ) = e a (19) 

or equivalently 

mi (/" D p\p) = mi (p U /) = 1 - ej (20) 

m 1 (p\p)=m' 1 (pUfUpUf) = e 1 (21) 



Q 



It is easy to check, from the mass mi (.\p), that one gets effectively Beli(/|p) = 1 — e\. Indeed: 

Bdi(/lp)=Bdi(fl 1 Ufl 3 b) 
Beli(/» = Beli((/np) U (fn P )\ P ) 
Beli(/|p) = mi(/np|p) +mi(/np|p) 

S v ' 

o 

+ mi((/np)u(/np)|p) 



Bel 1 (/»=m 1 (/npb) 
Bdi(/|p) = l-ei 

In a similar way, for the source B 2 with 6 2 defined as 

e 2 = {e 1 ±fnb,6 2 ±bnf,e 3 ±fnb,8 4 ±fnb} 



schematically represented by 



f = 9 2 U9 4 { 



4 = /n& 




o 3 = fnb 


02 = fnb 




Oi = f n b 



b=9 1 l)6 2 



one looks for m 2 (. | b) = [m' 2 ®m 2 }{.) withm 2 '(b) = m 2 (6 3 U0 4 ) = 1. From the MCP, the condition Bel 2 (/| b) = l-e 2 
and with simple algebraic manipulations, one finally gets 



or equivalently 



m 2 {6 3 \6 3 U 6i) = m' 2 {e-i U 6 2 U 6 3 ) = 1 - e 2 
m 2 {6 3 U 6> 4 16> 3 U 6 4 ) = m 2 (6 1 U 6 2 U 6> 3 U 6> 4 ) = e 2 



m 2 (/n6|fe) =m' 2 (6U/) = 1 - e 2 
m 2 {b\b)=m' 2 {bUfUbUf) = e 2 



(22) 
(23) 



(24) 
(25) 



In a similar way, for the source B 3 with 3 defined as 

e 3 = {e 4 4 6 n p, 6> 2 4 b n p, e 3 = P n 6, 6> 4 = b n p} 



schematically represented by 



p= e 3 u6i4 



6 = e 2 ue 4 < 



9 4 = bC)p 




9 3 = bn P 


9 2 = bf]p 




e 1 = br\p 



>b = e 1 u <9 3 



p=eiu# 2 



one looks for m 3 (.|p) = [m' 3 8 m 3 ](.) with m 3 (p) = m 3 (0 3 U6 4 ) = 1. From the MCP, the condition Bel 3 (6|p) = 1 - e 3 
and with simple algebraic manipulations, one finally gets 



or equivalently 



m 3 (8 3 \8 3 U 9 4 ) = m 3 (^ U 9 2 U 3 ) = 1 - e 3 
m 3 (6» 3 U 6» 4 |6» 3 U 9 4 ) = m' 3 (6 1 U 2 U 6 3 U 4 ) = e 3 



m 3 (6np|p) = m 3 (pU6) = 1 -e 3 
m 3(p\p) = m' 3 (b U p Li b U p) = e 3 



(26) 
(27) 



(28) 
(29) 



Since all the complete prior basic belief assignments are available, one can combine them with the Dempster's 
rule to summarize all our prior knowledge drawn from our simple rule-based expert system characterized by rules 



o 



R = {r*i, r 2 , r 3 } and convictions/confidences W = {iui, 102, 103} in these rules. 



The fusion operation requires to primilarily choose the following frame of discernment 9 (satisfying the Shafer's 
model) given by 

9 = {6>i, 62, 6*3, 64, 65, 9e, 67, ^s} 

where 

91 = fnbn P e 5 = fnbn P 

9 2 = fnbnp 6 e = fr\br\p 
e 3 = fnbnp e 7 = jc]bc] P 
9 4 = fnbnp e s = fnbnp 

The fusion of masses mi(.) given by eqs. J20t - d2 II with m2(.) given by eqs. d24l - J25i using the Demspter's rule of 
combination 1 17 1 yields mi2(.) = [mi © Ti2](-) with the following non null components 

m 12 {fr\br\p) =ei(l -e 2 )/Ki2 
m 12 {fr\br\p) = e 2 (l - ei)/ifi2 
mi 2 (& Dp) = eie 2 /^i2 

with iTi2 = 1 - (1 - ei)(l - e 2 ) = ei + e 2 - e\e 2 . 

The fusion of all prior knowledge by the Dempster's rule toi23(.) = [mi m2 ®ms](.) = [mi2 ©ms](.) yields the 
final result : 

mi23(/ n b n p) = mi 23 (6 , i) = ei(l - e 2 )/i^i23 
mi23(/n6np) = mi 23 (6 , 5) = £2(1 -ei)/-Ki23 
mi 23 (br\p) = mi23(#i U6» 5 ) = eie 2 /i^i23 

with K i23 = K12 = 1 - (1 - ei)(l - e 2 ) = ei + £2 - £i£2- 

which defines actually and precisely the conditional belief assignment mi23(.|p H b). It turns out that the fusion with the 
last basic belief assignment 7713 (.) brings no change with respect to previous fusion result mi2(.) in this particular problem. 

Since we are actually interested to assess the belief that our observed particular penguin-animal named Tweety (de- 
noted as T = (pP\b)) can fly, we need to combine all our prior knowledge m 12 3(.) drawn from our rule-based system with 
the belief assignment m (T = (p PI b)) = 1 characterizing the observation about Tweety. Applying again the Demspter's 
rule, one finally gets the resulting conditional basic belief function m ol23 — [m © m^] (.) defined by 

m ol23 (T= (/n&np)|T= (pnb)) = e x {l- e 2 )/K l2 
m ol23 {T=(fnbn P )\T=( P nb)) = e 2 (l-e 1 )/K 12 
m ol23 (T = (b n p)|T = (p n b)) = eie 2 /^i2 

From the Dempster-Shafer reasoning, the belief and plausibity that Tweety can fly are given by 1171 
Bd(T = /|T = (pn&)) = 

E 

x£2 e ,xCf 



^ m ol23 (T = x\T={ P nb)) 



Pi(T = f\T=( P nb)) = 

m ol23 (T = x\T={ P nb)) 

x€2 e ,2;rW0 

Because / = [(/ fl b Pip) U (/ D b Dp) U (/ n b Hp) U (/ D b Hp)] and the specific values of the masses defining m i 2 3(.)> 
one has 

Bel(T = /|T = (pn6)) = 

m 0l23 (T= {fnbn P )\T= (pnb)) 



1 n 



Fl(T=f\T=(pnb)) = 

m 0l23 {T = (fr\bn P )\T=( P r\b)) 



+ m ol2 3(T=(bnp)\T=(pnb)) 



and finally 



Bel(r = /|T=(pn6))= t2) (30) 



K 



12 



H (r = /|r = (pn6)) = ^^ + ^ = -^- (3i) 

A12 A12 A12 

In a similar way, one will get for the belief and the plausibility that Tweety cannot fly 

Bel(T = f\T = (p n 6)) = 62(1 ~ £l) (32) 

-^12 

P1(T = j\T = (p n 6)) = ^f^- + ^ = ^- (33) 
Using the first order approximation when ei and £2 are very small positive numbers, one gets finally 



Bel(T = f\T = (pfl 6)) = P1(T = /|T = (p n b)) 
In a similar way, one will get for the belief that Tweety cannot fly 

Bel(T = f\T = (p n b)) = P1(T = /|T = (p n 6)) 



ei + £2 



£2 

ei + £2 



This result coincides with the Judea Pearl's result but a different analysis and detailed presentation has been done here. 
It turns out that this simple and complete analysis corresponds actually to the ballooning extension and the generalized 
Bayesian theorem proposed by Smets in \22 25 \ and discussed by Shafer in 1 1 8 1 although it was carried out independently 
of Smets' works. As pointed out by Judea Pearl, this result based on DST and the Dempster's rule of combination looks 
very paradoxical/counter-intuitive since it means that if nonflying birds are very rare, i.e. £2 ~ 0, then penguin-birds like 
our observed penguin-bird Tweety, have a very big chance of flying. As stated by Judea Pearl in 1 12\ pages 448-449: 
"The clash with intuition revolves not around the exact numerical value of Bel(f) but rather around the unacceptable 
phenomenon that rule r 3 , stating that penguins are a subclass of birds, plays no role in the analysis. Knowing that Tweety 
is both a penguin and a bird renders Bel(T — f\T = (p PI b)) solely a function of m\{.) and to 2 (.), regardless of 
how penguins and birds are related. This stands contrary to common discourse, where people expect class properties to 
be overridden by properties of more specific subclasses. While in classical logic the three rules in our example would 
yield an unforgivable contradiction, the uncertainties attached to these rules, together with Dempster's normalization, 
now render them manageable. However, they are managed in the wrong way whenever we interpret if-then rules as 
randomized logical formulas of the material-implication type, instead of statements of conditional probabilities" . Keep 
in mind that this Pearl's statement is however given to show the semantic clash between the Dempster-Shafer reasoning 
vs. the fallacious Bayesian reasoning to support the Bayesian reasoning approach. 



5 The Dezert-Smarandache reasoning 

Before going further in our analysis, some clarification is necessary to explain to the reader the fundamental difference 
between the foundations of DSmT vs. DST. The DSmT can be easily viewed as a general flexible Bottom-Up approach 
for managing uncertainty and conflicts in fusion problems. It arises from the fact that the conflict between sources of 
evidence can come not only from the reliability of sources themselve (which can be handled quite easily by classical dis- 
counting methods) but also from a different interpretation of elements of the frame just because the sources or evidence 
have only a limited knowlege and provide their beliefs only with respect to their knowledge based usually on their own 
(local) experience, not to mention the fact that elements of the frame of the problem can truly be not refinable at all in 
some cases involving vague concepts like smallness/tallness, pleasure/pain, etc because of the continuous path from one to 
the other, etc. Based on this matter of fact, the DSmT proposes a new mathematical framework which starts at the bottom 
level (solid ground level) from the free DSm model and the notion of hyper-power set (Dedekind's lattice), then provides 
a general rule of combination to work with the free DSm model. Then it includes the possibility to take into account 
any kind of integrity constraints into the free DSm model if necessary through the hybrid DSm rule of combination. The 
taking into account for an integrity constraint consists just in forcing some elements of the Dedekind's lattice to be empty, 
just because they truly are for some given problems. 
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The introduction of an integrity constraint is like "pushing an elevator button" for going a bit up in the process of 
managing uncertainty and conflicts. If one needs to go higher, then one can take into account several integrity constraints 
as well in the framework of DSmT. If we finally wants to take into account all possible exclusivity constraints if we know 
that all elements of the frame of the given problem under consideration are truly exclusive, then we go directly to the Top 
level (the Shafer's model which serves as foundation for the DST). 

DSmT however can handle not only exclusivity constraints, but also existential constraints or mixed constraints as 
well which is helpful for some dynamic fusion problems. It is also important to emphaze that the hybrid DSm rule of 
combination is definitely not equivalent to the Dempster's rule of combination (and its alternatives based on the Top level) 
because one can stop and work at any level in the process of managing uncertainty and conflicts, depending on the nature 
of the problem. The hybrid DSm rule and Dempster's rule do not provide same results even if working with the Shafer's 
model as it will be proved in the sequel. The approach proposed by the DSmT to attack the fusion problem is totally new 
both by its foundations and the solution provided. 

The DSmT has been originally (ground-level) developed for the fusion of uncertain and paradoxical (highly conflict- 
ing) sources of information (bodies of evidences) based on the free DSm model M.' (9) which assumes that none of 
elements of the frame 9 are exclusive. This model is opposite to the Shafer's model. Let consider a free DSm model 
(9) with 9 = {6*i, . . . , #„}, the DSmT starts with the notion of hyper-power set D® defined as the set of all com- 
posite propositions built from elements of 9 with U and n (9 generates D e under operators U and n) operators such that 


1. 0,e 1) ...,0 n eD e . 

2. If A, B G D e , then A n B G D e and A U B G D B . 

3. No other elements belong to D , except those obtained by using rules 1 or 2. 

The cardinality of hyper-power set, d(n) = \D & \ for n > 1, follows the sequence of Dedekind's numbers 1, 2, 5, 19, 167, 
7580, 7828353, ... More details about the generation and partial ordering of elements of hyper-power set can be found in 
|4||5| l21l . From this model, authors have proposed a new simple associative and commutative rule of combination (the 
DSm classic rule) and then extended this rule to deal with any kind of hybrid models, i.e. sets 9 for which some proposi- 
tions/elements of D B are known or forced to be empty depending on the nature and the dynamicity of the fusion problem 
under consideration. In this framework, the Shafer's model appears only as a special hybrid model (the most constrained 
one, if we don't introduce existential constraints). The hybrid DSm fusion rule covers a wide class of fusion applications 
but is restricted to fusion of precise uncertain and paradoxical information only |21 1. We have recently extended this rule 
with new set operators for the fusion of imprecise, uncertain and paradoxical information - see |21 1 for details. 

We analyze here the Tweety penguin triangle problem with the DSmT. The prior knowledge characterized by the rules 
R = {ri,r 2 ,r 3 } and convictions W = {w X) w%, IU3} is modeled as three independent sources of evidence defined on 
separate minimal and potentially paradoxical (i.e internal conflicting) frames 9i = {p, /}, 92 = {b, /} and 93 = {p, b} 
since the rule 7*1 doesn't refer to the existence of b, the rule r 2 doesn't refer to the existence of p and the rule r 3 doesn't 
refer to the existence of / or /. Let's note that the DSmT doesn't require the refinement of frames as with DST (see 
previous section). We follow the same analysis as in previous section but now based on our DSm reasoning and the DSm 
rule of combination. 

The first source B\ relative to n with confidence w\ = 1 — ei provides us the conditional belief Beli(/|p) which 
is now defined from a paradoxical basic belief assignment mi(.) resulting from the DSm combination of m'lip) = 1 
with m[(.) defined on the hyper-power set D &1 — {0,p, f,p D f,p U /}. The choice for m' 1 (.) results directly from the 
derivation of the DSm rule and the application of the MCP. Indeed, the non null components of mi (.) are given by (we 
introduce explicitly the conditioning term in notation for convenience): 

1 1 

mi(p\p) = m'lip) m' x (p) + m"(p) m' x (p U /) 
1 1 

mi(p n f\p) = m'lip) m x {f) + m'{{p) m[(p n /) 

The information Beli (/|p) = 1 — ei implies 

Beli(/» = mi(/» + mi(p n f\p) = 1 - e x 

1 o 



Since mi (p\p) + mi(pfl f\p) = 1, one has necessarily mi (f\p) = and thus from previous equation mi (/ Dp \p) = 
1 — ei, which implies both 



"ii(p|p) = ei 

l l 
n = mi(p) m'^/) + m"(p) mi(p n /) 
= m' 1 (f)+m' 1 (pnf) 
= l-ei 

Applying the MCP, it results that one must choose 

m' 1 (/) = l-ei and m' 1 ( P n/)=0 
The sum of remaining masses of m'i(.) must be then equal to e±, i.e. 

m'i(p) + m[(pU f) = ei 
Applying again the MCP on this last constraint, one gets naturally 

m'i(p) = and m' 1 (p U /) = e\ 

Finally the belief assignment mi(.[p) relative to the source B\ and compatible with the constraint Beli (f\p) = 1 — ei, 
holds the same numerical values as within the DST analysis (see eqs. j20 M21» and is given by 

mi (p n f\p) = l-ei 
mi{p\p) = ei 

but results here from the DSm combination of the two following assignments (i.e. mi(.) = [mJffira'/K.) = [m'i'©m' 1 ](.)) 

fm'i(/) = l-6i and m[(pUf) = e 1 
\m'{(p) = l 

In a similarly manner and working on 82 = {b, /} for source B2 with the condition Bei2(/|&) = 1 — £2, the mass 
m-2 (. 1 6) results from the internal DSm combination of the two following assignments 

fm 2 (/) = l-e 2 and m' 2 (bUf) = e 2 
KO) = 1 

Similarly and working on 83 = {p, b} for source B3 with the condition Bei3(6|p) = 1 — 63, the mass m^{.\p) results 
from the internal DSm combination of the two following assignments 

(m' 3 (b) = l-e 3 and m' 3 (bUp) = e 3 
\m'i(p) = l 

It can be easily verified that these (less specific) basic belief assignments generates the conditions Beli (f\p) = 1 — ei, 
Bel 2 (/|6) = 1 - e 2 and Bel 3 (%) = 1 - e 3 . 

Now let's examine the result of the fusion of all these masses based on DSmT, i.e by applying the DSm rule of 
combination of the following basic belief assignments 



mi(p n f\p) = 


1 


- ei 


and 


mi(p\p) 


= (1 


m 2 {bnf\b) = 


1 


- 62 


and 


m 2 {b\b) 


= 62 


m 3 (p n b\p) 


1 


- 63 


and 


m 3 (p\p) 


= 63 



Note that these basic belief assignments turn to be identical to those drawn from DST framework analysis done in 
previous section for this specific problem because of integrity constraint / H / = and the MCP, but result actually from 
a slightly different and simpler analysis here drawn from DSmT. So we attack the TP2 with the same information as with 
the analysis based on DST, but we will show that a coherent conclusion can be drawn with DSm reasoning. 
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Let's emphasize now that one has to deal here with the hypotheses/elements p, b, f and / and thus our global frame 
is given by = {b,p, /, /}. Note that O doesn't satisfy the Shafer's model since the elements of are not all exclusive. 
This is a major difference between the foundations of DSmT with respect to the foundations of DST. But because only 
/ and / are truly exclusive, i.e. / n / = 0, we face a simple hybrid DSm model M. and thus the hybrid DSm fusion 
must apply rather than the classic DSm rule. We recall briefly here (a complete derivation, justification and examples can 
be found in [21 1) the hybrid DSm rule of combination associated to a given hybrid DSm model for k > 2 independent 
sources of information is defined for all A £ D B as: 



m Mm (A) 4 0(A) Si (A) + S 2 (A) + S 3 (A) 



(37) 



where <p(A) is the characteristic emptiness function of the set A, i.e. <j)(A) = 1 if A (0 = {0, 0m} being the set of 
all relatively and absolutely empty elements) and (f>(A) = otherwise, and 

k 

Si(A)± Y HMXi) (38) 

X!,X 2 ,...,X k <£D 1=1 
{X 1 nX 2 n...nX k )=A 

k 

S 2 (A)± y (39) 

Xx,x 2 ,...,x h e9 *=i 
[U=A]V[(U£0)A{A=h)] 

k 

Ss(A)± Y, U^(^) (40) 

x 1 ,x 2 ,...,x k eD° *=i 

(X 1 UX 2 U...UX k )=A 

(X!nx 2 n...nx k )e0 

with U = u(Xi) U u(X 2 ) U . . . U u(Xk) where u(X) is the union of all singletons 0j that compose X and I t = 
0i U #2 U . . . U 8 n is the total ignorance defined on the frame = {6*i, . . . , 8 n }. For example, if X is a singleton then 

u(X) = X; if X = 6 X n 2 or X = 9 1 U 6 2 then u{X) = X U 2 ; if X = (6 1 n 2 ) U 3 then u(Jf ) = 0i U 2 U 3 ; by 
convention tt(0) = 0. 

The first sum Si (A) entering in the previous formula corresponds to mass m M j r®} (A) obtained by the classic DSm 
rule of combination based on the free DSm model M.^ (i.e. on the free lattice D B ). The second sum 5*2 (A) entering in the 
formula of the hybrid DSm rule of combination d37i represents the mass of all relatively and absolutely empty sets which 
is transferred to the total or relative ignorances. The third sum Ss(A) entering in the formula of the hybrid DSm rule of 
combination d37l > transfers the sum of relatively empty sets to the non-empty sets in the same way as it was calculated 
following the DSm classic rule. 

To apply the DSm hybrid fusion rule formula < !37i . it is important to note that (pDf) D (6D/) Dp = pHbn f n / = 
because / n / = 0, thus the mass (1 — ei)(l — £2)^3 is transferred to the hybrid proposition Hi = (pfl /) U (6 Fif) Up = 
(b fl /) Up; similarly (p PI /) fl (b D f) H (p D b) = pDbD f C) f = because / n / = and therefore its associated mass 
(1 — ei)(l — ea)(l — e s) is transferred to the hybrid proposition H 2 = (pD f) U (6 n /) U (pfl b). No other mass transfer 
is necessary for this Tweety Penguin Triangle Problem and thus we finally get from DSm hybrid fusion formula J37i the 
following result for mi 2 3(.\p fl b) = [mi © m 2 © 7713] (.) (where © symbol corresponds here to the DSm fusion operator): 

mi23(Hi\p d b) = (1 -ei)(l - e 2 )e 3 

m 123 {H 2 \p n b) = (1 - ei)(l - e 2 )(l - e 3 ) 
"I123 (p n b n f\p n b) = (1 - ei)e 2 e 3 + (1 - ei)e 2 (l - e 3 ) 
toi 23 (p n b n f\p n b) = ei(l - e 2 )e 3 + ei(l - e 2 )(l - e 3 ) 
m 123 (p n b\p R b) = ei£ 2 e3 + £i£2(l - £3) 

with 

[Hi ^(bnf)u P 

\H 2 = bn/)u(6n/)u(pn6) 

It can be easily checked that these masses sum up to 1 . After elementary algebraic simplifications, one finally gets for 



I A 



the DSm fusion of all available prior information and reintroducing explicitly the conditioning term 



mua(Hi\p n b) = (1 - ei)(l - e 2 )e 3 

m 123 (H 2 \p n b) = (1 - ci)(l - e a )(l - es) 
m 123 (p H 6 n /|p n 6) = (1 - ei)e 2 
m 123 (pn6n/|pn6) = Cl (l -e 2 ) 
mi23(p n 6|p n 6) = eie 2 

We can check all these masses add up to 1 and that this result is fully coherent with the rational intuition specially 
when e 3 = 0, because non null components of mi 23 (.|p (~l b) reduces to 

m 123 (H 2 \ P nb) = (1 -e x )(l-e 2 ) 
wi 23 (p n b n f\p n 6) = (1 - ei)e 2 

TOi 23 ( P n6n/|pn6) = ei(i - e 2 ) 

^I23(pn6|pn6) = eie 2 

which means that from our DSm reasoning there is a strong uncertainty (due to the conflicting rules of our rule-based 
system), when t\ and e 2 remain small positive numbers, that a penguin-bird animal is either a penguin-nonflying animal 
or a bird-flying animal. The small value t\t 2 for m\ 23 {j> n b\p H 6) expresses adequately the fact that we cannot commit 
a strong basic belief assignment only to pDb knowing p n b just because one works on 9 = {p, b, f, /} and we cannot 
consider the property p n b solely because the "birdness" or "penguinness" property endow necessary either the flying or 
non-flying property. 

Therefore the belief that the particular observed penguin-bird animal Tweety (corresponding to the particular mass 
m (T — (p n b)) = 1) can be easily derived from the DSm fusion of all our prior summarized by mi 23 (.|p n b) and the 
available observation summarized by m (.) and we get 

m ol23 (T = n b n f)\T = (p n bj) = (1 - ei )e 2 

m i 23 (T= (pn&n/)|T= (pnh)) = ei (l-e 2 ) 
m i 23 (T= (pnb)\T= (pnb)) =eie 2 

m ol23 (T = H^T = (pn 6)) = (1 - ei)(l - e 2 )e 3 
m ol23 (T = H 2 \T = (p n 6)) = (1 - ei)(l - e 2 )(l - e 3 ) 

From the DSm reasoning, the belief that Tweety can fly is then given by 

Bel(T = f\T = (p n 6)) = rn ol23 (T = x\T=(pnb)) 

x£D ,xCf 

Using all the components of m D i 23 (.|T = (pD b)), one directly gets 

Bel(T = f\T = (p n 6)) = m ol23 (T = (/ n 6 n p)|T = (p n 6)) 

and finally 

Bel(T = /|T =(pn &)) = e x (l-e 2 ) (41) 
In a similar way, one will get for the belief that Tweety cannot fly 

Bel(T = f\T = (p n 6)) = e 2 (l - ei) (42) 

So now for both cases the beliefs remain very low which is normal and coherent with analysis done in section [3~2l 
Now let's examine the plausibilities of the ability for Tweety to fly or not to fly. These are given by 

Pl(T = f\T=(pnb))± J2 ™ i23(T = x\T=(pnb)) 

xeD e ,xr]f=^ 

Pl(T = /|T=(pnb))^ Yl ™ ol23 (T = x\T=(pnb)) 

which turn to be after elementary algebraic manipulations 

Pl(T = /|T=(pn&)) = (l-e 2 ) (43) 



H(T = /|T=(pn&)) = (l-ei) 



(44) 



So we conclude, as expected, that we can't decide on the ability for Tweety of flying or of not flying, since one has 
[Bel(/|p n 6), Pl(/|p n &)] = [6,(1 - e 2 ), (1 - e 2 )] « [0, 1] 

[Bel(/> n b), Pl(/> n b)} = [e 2 (l - ci), (1 - ei)] « [0, 1] 

Note that when setting t\ = and e 2 = 1 (or ei = 1 and e 2 = 0), i.e. one forces the full consistency of the initial 
rules-based system, one gets coherent result on the certainty of the ability of Tweety to not fly (or to fly respectively). 

This coherent result (radically different from the one based on Dempster-Shafer reasoning but starting with exactly 
the same available information) comes from the DSm hybrid fusion rule which transfers some parts of the mass of empty 
set m(0) = (1 — ei)(l — e 2 )e 3 + (1 — ei)(l — e 2 )(l — e 3 ) w 1 onto propositions H\ and iJ 2 . It is clear however that 
the high value of m(0) in this TP2 indicates a high conflicting fusion problem which proves that the TP2 is a truly almost 
impossible problem and the fusion result based on DSmT reasoning allows us to conclude on the true undecidability on 
the ability for Tweety of flying or of not flying. In other words, the fusion based on DSmT can be applied adequately on 
this almost impossible problem and concludes correctly on its undecidability. Another simplistic solution would consist 
to say naturally that the problem has to be considered as an impossible one just because m(0) > 0.5. 

6 Conclusion 

In this paper we have proposed a deep analysis of the challenging Tweety Penguin Triangle Problem. The analysis proves 
that the Bayesian reasoning cannot be mathematically justified to characterize the problem because the probabilistic 
model doesn't hold, even with the help of acceptance of the principle of indifference and the conditional independence 
assumption. Any conclusions drawn from such representation of the problem based on a hypothetical probabilistic model 
are based actually on a fallacious Bayesian reasoning. This is a fundamental result. Then one has shown how the 
Dempster-Shafer reasoning manages in what we feel is a wrong way the uncertainty and the conflict in this problem. We 
then proved that the DSmT can deal properly with this problem and provides a well-founded and reasonable conclusion 
about the undecidability of its solution. 
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